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Document Image Encoding/Decoding 



FIELD OF INVENTION 

5 The present invention relates broadly to a method and system for encoding a 

document image, to a method and system for decoding a compressed document image 
stream, to a computer readable data storage medium having stored thereon code means 
for instructing a computer to execute a method of encoding a document image; and to a 
computer readable data storage medium having stored thereon code means for 
1 0 instructing a computer to execute a method of decoding a compressed document image 
stream. 

BACKGROUND 

15 

As electronic storage, retrieval and distribution of documents becomes faster and 
cheaper, digital documents are being increasingly used. Typically, documents are being 
re-typed and converted to HTML or Adobe's PDF format. Alternatively, an Optical 
Character Recognition (OCR) technique may be used to convert a hard copy of a 

20 document into a digital document. Unfortunately, these techniques are still found to be 
far from suitable for faithfully translating a scanned document into a web page, and 
much of the visual aspect of the original document is likely to be lost. 

Recently, image-based approaches to digital documents have been proposed. 
One such "image-based approach" to digital documents is to store and to transmit 

25 documents as one or more images.Traditional image compression standards such as 
JPEG and GIF are found to be inappropriate for document images. Although such image 
compression standards are suitable for continuous-tone images (i.e. for most pictures of 
natural scenes), these standards are not suitable for sharp edges of character images. 
On the other hand, a scanned document tends to be quite large if one wants to preserve 

30 the readability of the text. 

It is with the knowledge of the above mentioned background and concerns 
that the present invention has been made and is now reduced to practice. 
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SUMMARY 

In accordance with a first aspect of the present invention there is 
5 provided a method of encoding a document image, the method comprising extracting 
one or more picture areas from the document image; extracting one or more character 
areas from the document image; obtaining a background image by subtracting the image 
and character areas from the document image; classifying character blocks of the 
character areas with reference to dynamically generated templates; and encoding the 
1 0 background image utilising a SAQ wavelet encoder. 

The extracting of the picture areas and/or the character areas may comprise 
marking blocks partitioned from the document image based on features of wavelet 
coefficients of the respective blocks. 

15 

The extracting of the pictures areas may comprise a hierarchical extraction 
comprising extracting picture blocks from the document image to generate one or more 
initial picture areas and refining the initial picture areas by extracting picture pixels 
adjacent to the initial picture areas. 

20 

The extracting of the character areas from the document image may comprise 
utilising a customised definition of the connectivity of the pixels. 

The method may further comprise generating style data as a description of the 
25 templates and character blocks. 

The classifying the character blocks may comprise a hierarchical matching 
comprising matching the style of each character block based on the style data and then 
matching each character block against selected ones of the templates based on the 
30 style data matching. 

The classifying of the character blocks based on the templates may comprise 
morphological matching. 
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The morphological matching may comprise matching algorithms Mi and M 2 , 

Different structure elements may be utilised for different types of document 
images. 

5 

The method may further comprise bit plane storage of a compressed stream of 
the document image in the order of character areas, picture area and background image 
for progressive decoding. 

10 In accordance with a second aspect of the present invention there is provided a 

method of decoding a compressed document image stream, the method comprising 
extracting one or more picture areas from the compressed document image stream; 
extracting one or more character areas from the compressed document image stream; 
extracting a background image from the compressed data image stream; identifying 

15 character blocks of the character areas with reference to dynamically generated 
templates in the compressed document image stream; decoding the background image 
utilising a wavelet based SAQ method; and constructing a decoded document image by 
adding the picture areas, the character areas and the background image. 

20 In accordance with a third aspect of the present invention there is provided a 

computer readable data storage medium having stored thereon code means for 
instructing a computer to execute a method of encoding a document image, the method 
comprising extracting one or more picture areas from the document image; extracting 
one or more character areas from the document image; obtaining a background image 

25 by subtracted the image and character areas from the document image; classifying 
character blocks of the character areas with reference to dynamically generated 
templates; and encoding the background image utilising a wavelet based SAQ method. 

In accordance with a fourth aspect of the present invention there is provided a 
30 computer readable data storage medium having stored thereon code means for 
instructing a computer to execute a method of decoding a compressed document image 
stream, the method comprising extracting one or more picture areas from the 
compressed document image stream; extracting one or more character areas from the 
compressed document image stream; extracting a background image from the 
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compressed data image stream; identifying character blocks of the character areas with 
reference to dynamically generated templates in the compressed document image 
stream; decoding the background image utilising' a wavelet based SAQ method; and 
constructing a decoded document image by adding the picture areas, the character 
5 areas and the background image. 

In accordance with a fifth aspect of the present invention there is provided a 
system for encoding a document image, the system comprising means for extracting one 
or more picture areas from the document image; means for extracting one or more 
10 character areas from the document image; means for obtaining a background image by 
subtracted the image and character areas from the document image; means for 
classifying character blocks of the character areas with reference to dynamically 
generated templates; and means for encoding the background image utilising a wavelet 
based SAQ method. 

15 

In accordance with a sixth aspect of the present invention there is provided a 
system for decoding a compressed document image stream, the system comprising 
means for extracting one or more picture areas from the compressed document image 
stream; means for extracting one or more character areas from the compressed 

20 document image stream; means for extracting a background image from the, 
compressed data image stream; means for identifying character blocks of the character 
areas with reference to dynamically generated templates in the compressed document 
image stream; means for decoding the background image utilising a wavelet based SAQ 
method; and means for constructing a decoded document image by adding the picture 

25 areas, the character areas and the background image. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 Embodiments of the invention will be better understood and readily apparent 

to one of ordinary skill in the art from the following written description, by way of 
example only, and in conjunction with the drawings, in which: 

Figure 1 shows a block diagram illustrating an encoder process in an 
example embodiment. 
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Figure 2 shows a block diagram illustrating a decoder process in an example 
embodiment. 

Figure 3 shows a block diagram illustrating an image block extractor process 
in an example embodiment. 

Figure 4 shows a block diagram illustrating a process for clustering of 
character images in an example embodiment. 

Figure 5 is a schematic drawing illustrating a computer system for 
implementing the method and system of an example embodiment. 

DETAILED DESCRIPTION 

Embodiments of the present invention provide an image compression technique 
for classifying, matching and identifying document images based on a wavelet 
compression method. This method may be referred to as a wavelet document image 
compression (WDIC) method. More specifically, in embodiments of the present 
invention, the character and picture components may be separated from the 
backgrounds of one or more original document images and different methods used to 
compress each of those components. More generally, embodiments of the present 
invention may also be applied to other special documents such as particularly important 
historical documents, scientific papers with mathematical or chemical formulae, software 
documents and some handwritten signatures. 

Embodiments of the present invention provide an - approach for compression of 
document images enabling a high-quality version of one page of a document image to 
be transferred at very high compression ratios. 

The example embodiments comprise a number of novel algorithms for an 
improved document image compression method. Two main categories of picture areas 
and character areas may be extracted from the document image and the background 
image encoded by subtracting these two category areas from document images, in the 
example embodiments. 
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The character image may be encoded with an extent-based morphological 
matching, clustering and wavelet compression algorithm. A picture image may be 
encoded with a wavelet-based compression algorithm, which is suitable for grey scale 
images. A background image may also be encoded with a wavelet-based successive 
approximation quantization (SAQ) compression algorithm. 

WDIC, in example embodiments, is a progressive code. WDIC provides 
progressive decoding not only on background images, but also on character images. 

In the following sections the example embodiments are described. The features 
of WDIC of the example embodiments comprise special image segmentation for a 
document image, fast classification, a morphological matching and clustering algorithm 
for character images, and a wavelet-based compression algorithm for picture images. 
Results from an actual implementation experiment showed a significant performance 
improvement over prior art methods in respect of two aspects. Firstly, WDIC allows a 
highly efficient compression format and secondly, a progressive range of compression 
rate scalability to be achieved in the example embodiments. 

It is assumed that the intensity of background pixels has a possible maximum 
intensity value 7 max and the intensity of characters, pictures is positive in the example 
embodiment Firstly, the image is posterized into 3 levels as below. 

0 whenI(y)>T 0 
F(v) = <\ whenT x <I(y)<T Q , 

2 whenI(v)<T x 
where I(y) is the intensity of the pixel at v = (v x , v y ), and T x = 7 max /2, 
where T 0 is calculated in step 301 . 

The following algorithm is performed at all untraced pixels u with F(u ) = 2. 

1. S=<j>, S t =[u) 9 fT = ~xC, 
C is slightly larger than font size of most characters/letter, (default C = 24) 
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2. Find veS v {v; } represent eight neighbor pixels ofvin 
clockwise order, among them {v, , v 3 , v 5 , v 7 } are 4-neighbor pixels of v. 
Define v Mk = v l ,k&Z.S = S\J{v), S l =>S l \ {v} 

3. forv i ,i = l,—,8, \v ix -u x \^W,and \v ty -u y \zW 
5 a. if i= 1,3,5,7, 

i. if F(v) = 2 and (F(v, ) = 2or (F(v, ) = land F(v,.j ) + F(v M ) £ 
thenS x =S l U{v,} 

ii. if F(v) = 1 and (F( Vi ) = 2 and (F(v,. 2 )+F(v M ) * 2)) 
thenS i =S l \J{v t ) 

10 b. if i=2,4,6,8, 

if F(v) = 2 and (F(v, ) = 2 and F(v M ) + F(v M )>1) then S X =S X \J {v, } 

4. if S { go to step 2 

5. A = {(x,y) | x min <x< x^y^ <y<y max } represents a 
character image block. 

1 5 where x mln = min{ v x }, x max = max{ v x }, y min = min{ v y max = max{ v } 

veS reS reS veS 7 

After character image block A is extracted and saved into the character block list, 
the pixels in this block are marked as the traced pixels and the value of the pixels in the 
character list block are changed to 255. The same procedure starts from untraced pixels 
satisfying F(u) = 2 until no such pixel exists. 

20 

The character images 108 are the blocks representing the lines and characters 
extracted at step 107 from the residue image 106 in the example embodiment. Process 
step 109 clusters the character images 108 hierarchically. Step 109 will be described in 
further detail in steps 401 to 413 below. Process step 109 outputs data 110 comprising 
25 the character template library and the code of every character block outputted from step 
109. The code of the character blocks includes the absolute coordinates of the block in 
the original image 101 and the index of the template the block uses. At step 111 the 
process of Figure 1 encodes the character codes of the character blocks and character 
template library by an SAQ encoder in the example embodiment. 



30 
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The output 112 of step 111 is a compressed bit stream for the characters. While 
the data 112 is passed to the process step 118, the data will be decoded by a decoder at 
step 113 which is the counterpart of the SAQ encoder used at step 104. The 
reconstructed character images 114 are used to generate the background image 115. 
Process step 116 utilises a SAQ wavelet encoder for grey scale images in the example 
embodiment. Reference is made to J.M.Shapiro, "Embedded Image Coding Using 
Zerotrees of Wavelet Coefficients", IEEE Trans. On Signal Processing, Vol. 41, No. 12, 
Dec, 1993, pp. 3445-3426 and to Said, and W. A. Pearlman, "A New Fast and Efficient 
Image Codec Based on Set Partitioning in Hierarchical Trees", IEEE Trans, on Circuits 
and Systems for Video Technology, Vol. 6, No. 3, June 1996, pp. 243-250 for details of a 
suitable encoding process. 

The compressed bit stream 117 for the background image 115 is passed to the 
process step 118. The process step 118 organizes the compressed bit stream of picture 
image blocks, character image blocks and the background image to generate the 
compressed data bit stream 119 for the whole document image. 

The compressed data bit stream 119 is organized as described in the following 
example embodiment. The document image header and character codes of character 
blocks and location information of picture image blocks are saved first. Then the 
compressed stream for the first two most significant bit planes of a character template 
library are saved. In the compressed stream of pictures and background images, the 
stream for the bit planes whose value is greater or equal than the value of the second 
most significant bit plane of character pictures is saved next. Finally the remaining 
compressed streams for characters, pictures and background image are added one bit 
plane followed by another from the most significant one to the least significant one in an 
interlaced manner. This interlaced pattern saves compressed stream of character 
templates first and then the stream of pictures and background for the same bit plane, in 
the example embodiment. Such organization may ensure the progressive decoding of 
the document image in the example embodiment. In other words, one can obtain the 
document image from coarsest version to the finest version. 

The picture image block extractor process step 102 is described in further detail 
in the following, with reference to Figure 3. 



WO 2005/046213 



9 



PCT/SG2004/000366 



As seen in Fig. 3, process step 301 estimates the peak value P 0 of the histogram 
of document image, threshold T Q = (T x + P 0 )/2, the pixels of intensity of pixel less than 
T 0 are classified as foreground pixel, other pixels are background pixels. 

5 

Process step 302 partitions the entire document image into blocks with size 
WxW where W = 2 l z J and r is at scanned resolution. 

Process step 303 classifies blocks in to two types: picture blocks marked by 1 
10 and nonpicture blocks marked by 0. The classification is based on the statistical features 
of wavelet decomposition of blocks. The procedure is as following in the examples 
embodiment 

Using the wavelet filter to decompose the block once as conventional wavelet 
15 decomposition of image. For the computation efficiency, the sum of filter coefficients is 2 
and the suggested filter for this procedure is a Haar wavelet filter. The diagram below 
shows this procedure. LL, LH, HL and HH are the notations of lowest frequency 
component to highest frequency component as will be appreciated by a person skilled in 
the art. 

20 



L 


H 


L 


H 



In general, a document image is typically composed of a large portion of 
characters and edge regions, together with a relatively smaller portion of homogeneous 
regions. Homogeneous regions have the least variation. Characters regions have 
25 moderate variation; and lines show the most variation. 
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, v fl when \c\>A 
£( C ) = L , . where A is a predefined threshold (default A=16 for 
[0 otherwise 

images with 256 as maximum intensity value, in an example embodiment) and c is the 
wavelet coefficients. 



The statistical variable used in the classification is as follows: 

Hs(c u ) 

countH = (u) ^ — , whereH = HL (JLH U HH 

average^ = (iJ >* LL — where S u is the total number of wavelet coefficients of LL 

If count H <B and average LL <(T Q +1^12, where B is the predetermined 

threshold whose default value is 3, the block is marked as picture block, otherwise the 
block is marked as nonpicture block. 

Switch 304 checks whether untraced picture blocks exist. If the answer is NO, all 
picture blocks are saved in data 316 already and the process 102 of Fig. 3 finishes. 

Otherwise, the next untraced picture block is identified in step 305 the mark of 
the next untraced picture block is changed to zero, and the picture area is initialised to 
the minimum rectangle containing current block in the next process step 306. 

The process step 317 as seen in Fig. 3 extracts the rectangle area of the picture 
image and consists of two steps in the example embodiment Firstly, process step 318 
extracts the picture blocks from the picture image. Then the rectangle area will further 
grow to neighbouring pixels in process step 319 if necessary, as seen in Fig. 3. 

Switch 307 checks whether there is a neighbour block of the current picture area 
whose mark is 1. If the answer is YES, the neighbour block is marked 0 in step 308 and 
the picture area is extended to a new rectangle area containing this block in process 
step 309, the process step 318 returns to switch 307. If the answer is NO, all neighbour 
blocks are not picture blocks. Process step 318 is completed and the process of Fig. 3 
proceeds to switch 310. 
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Switch 310 checks whether the rectangle picture area is big enough by 
comparing the length and width to the preset value (default 2W ). If answer is NO, there 
is no picture area found and the process of Fig. 3 turns to switch 304. Otherwise, the 
answer is YES, and the location information of the picture area is stored in step 311. 

Process step 319, as seen in Fig. 3, comprising the following steps refines the 
picture area in the example embodiment. Switch 312 checks whether there is a fore- 
pixel in the neighbour pixels of current area. If the answer is YES, process step 313 
extends the picture area to the new rectangle picture area containing the found fore-pixel 
and the process 319 returns to switch 312. If the answer is NO, all neighbour pixels of 
current picture area are back-pixels. Process step 319 finishes and this rectangle picture 
is saved as a picture image area in process step 314. Process step 315 appends this 
picture image area to the list of picture images and the process of Fig. 3 returns to 
switch 304. 

With reference to Figure 4, process step 401 generates the style of characters 
Q = {(f,J) | I t (iJ) < T 29 i = 0,1,..., h -1, J = 0,1,..., 

where is intensity of pixel at coordinates (ij) in character image block and 

block distance of two pixels is defined as d((i u j\), (i 2 J 2 )) =| i 2 -ij | + 1 j 2 - j\ | . 
Then the style of this character is defined as {^h i d lt ,d lbi d rn d rb ) where 

d * = minW((o > o),(/,y))) - di b =min(d((M-V,(iJ))) 

d n =min(d((™~W),(hj))) . d rb = m i n (d((w-i,h-i) t (ij))) 

Three sets L 0 ,L if L 2 may be defined for process step 109. L 0 is the collection of 
character images blocks. L x is the collection of the character code of the character 
image blocks consisting of the index of the matched character template in the character 
template library and the locations of the character image blocks. L 2 is library of 
character templates used to save the images of character templates. Switch 402 checks 
whether L 0 is empty, if the answer is YES, all character blocks have been processed, 
then data 403 comprising L x and L 2 will be outputted and the process 109 concludes. 
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Otherwise, the answer is NO in step 402, the next character block Tin L 0 is retrieved in 
process step 404. Process step 404 is the process of matching character block T against 
templates in L 2 . Starting from the head of L 2 , check whether all templates in L 2 have 
been used at switch 406. 

If the answer is YES, T is a new type of character, in step 407 T is appended to 
L 2 as a new character template 71, the code information of T against TL is saved to L v 
and T is removed from L Ql then the process returns to switch 402. 

Otherwise, if the answer is NO in 406, the character template TL is retrieved from 
L 2 in step 408. T is matched against TL by two steps, first match T against TL in 
process step 409. In process step 409, we compute the absolute values of differences of 
all entries between style of T and style of TL.. Switch 410 checks the result of process 
409, if one of the absolute values is greater equal than predetermined threshold, the 
answer is NO, the process of Fig. 4 proceeds to step 406. If the answer is YES, then 
match T again st TL by morphological character matching method in process step 411. 

Proess step 411 uses a morphological approach in the example embodiment 
with which the matching of two characters is fast and accurate compared to conventional 
matching methods such as matching by grey scale similarity. The new measurement 
based on morphological approach in the example embodiment may perform better than 
Euclidean distance measurement and Hausdorff measurement in the case of a noisy 
environment due to the stability of the measurement. 

The morphological operator in the example embodiment measures the size of the 
difference image of two images (i.e., one is the template and the other is character 
block). Assume the two images are f and g, the difference image f-g is defined as 
follows: 




1, F((x t y) f )+F((x,y) g )< 4 and \f(x,y)-g(x,yj\ > C, 



0, otherwise 



threshold C M = 32. 
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The difference image f-g is a binary image. In other words, the difference image 
f-g is a binary set. 

The size of set A of structure element B may be defined as 

e( A) B = sup{ AoaB *<f>},a e 9t where A°B is normal morphological open operator, 

a 

The new measurement of the difference between two binary sets may be defined 
as s b (f> S) = e(f - g) B » where B is square structure element of size 1. 

The similarity measure of two sets/, g is M(f,g) = mzx{S B (f-g\ S B (g-f)}. 
The new measurement is symmetric in the sense of the distortion is concave distortion 
or convex distortion; however, the Hausdorff measurement is not symmetric, as will be 
appreciated by the person skilled in the art. Reference is made to W. Gong, Q. Y. Shi, 
and M. D. Cheng, Shape and image matching by use of morphology, Proc. 11 th Int. 
Conf. On Pattern Recognition, vol. 2, 673—676, The Hague, The Netherlands, 1992 

If the measure is less than the average size of the noise region, the matching is a 
success. A fast algorithm may be defined in the example embodiment based on this 
theory for matching of a character problem. The measure of the difference is modified as 
M(f,g) = S B (f-g). For the matching of the character image, e.g. with resolution no 
less than 72, if the measure is less than 2, the matching of character against template is 
a success. The algorithm may be defined as follows: 

Algorithm M 1 

1. Suppose (f - g)(x) is a sequence with length m. x<r-0, 

2. if(f-g)(x) = 0,gotostep5 

3. if(f-g)(x + V = 0 / (f-g)(x) ^0,gotostep5 

4. x*-x + l 

5. if(x<m-\) x<r- Jt + 1, go to step 2 

6. end 
Algorithm M 2 

1 Suppose (f -g)(x)isa sequence with length m. x <- 0, 

2. if (f g)(x) = 1 and (f - g)(x + 1) = 1, go to step 5 



WO 2005/046213 



14 



PCT/SG2004/000366 



3. if(x<m-\) x<r-x + l, gotostep2 

4. character matches against template, go to step 6 

5. character does not match against template, go to step 6 

6. end 

The condition is weak or does not depend on the structure element used in the 
algorithm and the associated part of algorithm M 2 in the example embodiment. Here 
the condition is strong means that matching a character against template is difficult. On 
the contrary, the conditions is weak means matching a character against a template is 
very easy. Strong condition will decrease the compression ratio slightly but weak 
condition will generate false matching and the reconstructed character may not be 
correct when the scanned document image quality is very poor. The order of line, circle 
to square corresponds to the conditions from strong to weak. 

The following is noted: 

If algorithm Mi performs only along row direction, the structure element used in 
the matching algorithm is a line of horizontal direction in the example 
embodiment. This element is found to be good enough for e.g. English 
character matching. 

If algorithm M 1 performs only along column, the structure element used in the 
matching algorithm is a line of vertical direction in the example embodiment. 
If algorithm Mi performs along both row and column directions, the structure 
element used in the matching algorithm is circle in the example embodiment 
Circle structure element is found to work well for character of most languages. 
If algorithm performs along row direction followed by column direction and 
then performs along column direction followed by row direction, the structure 
element used in the matching algorithm is square in the example embodiment. 
For the structure element of lines we may only need to apply algorithm M 2 along 
same direction as Mi does. For the structure element circle algorithm M 2 
performs at either horizontal direction or vertical direction in the example 
embodiment. For the structure element squane, algorithm M 2 performs at both 
horizontal and vertical directions before we conclude that the match is success in 
the example embodiment. 
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Returning to the process of Fig. 4, switch 412 checks whether T matches against 
TL If the answer is NO, the process of Fig. 4 returns to switch 406. Otherwise, the 
answer is YES, information of 7" is appended to i, and code of T is index of pattern TL 
in Z^, then the process step 413 removes 7 from L 0 , and then the process of Fig. 4 
proceeds to switch 402. 

Figure 2 illustrates the decoder process 200 in the example embodiment, that is 
the reverse process of the encoder 100. Decoder process 200 begins from a 
compressed bit stream 201 of the document image. Process step 202 separates the bit 
stream 201 into three parts based on the formats of the compressed document image 
described in step 118. These three parts are a compressed bit stream 203 of 
background image, compressed bit stream 206 of character image blocks and 
compressed bit stream 209 of picture image blocks. 

Data 203 is decoded by wavelet based SAQ decoder 204 to generate the 
background image. Data 206 is decoded by a character decoder at process step 207 to 
generate the information of character codes of character image blocks and character 
template library. Data 209 is decoded by SAQ wavelet decoder at step 210 to generate 
the picture image blocks 211. Data 205, 208 and 211 may be combined to generate the 
document image 213 in process step 212. 

The method and system of the example embodiment can be implemented on 
a computer system 500, schematically shown in Figure 5. It may be implemented as 
software, such as a computer program being executed within the computer system 
500. 

The computer system 500 comprises an extracting module 501, which 
extracts the characters 501, pictures 502 and background image 503. The character 
encoding module 505 and picture encoding module 506 and background image 
encoding module 507 compress the characters 501, pictures 502 and background 
image 503 respectively. The output compressed stream is interlaced in module 508. 
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The compressed bit stream 509 is the compressed document image 
generated by document image encoding. 

It will be appreciated by a person skilled in the art that numerous variations 
and/or modifications may be made to the present invention as shown in the specific 
embodiments without departing from the spirit or scope of the invention as broadly 
described. The present embodiments are, therefore, to be considered in all respects to 
be illustrative and not restrictive. 
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CLAIMS 

1 . A method of encoding a document image, the method comprising: 
extracting one or more picture areas from the document image; 
extracting one or more character areas from the document image; 

obtaining a background image by subtracting the image and character areas 
from the document image; 

classifying character blocks of the character areas with reference to dynamically 
generated templates; and 

encoding the background image utilising a SAQ wavelet encoder 

2. The method as claimed in claim 1, wherein the extracting of the picture 
areas and/or the character areas comprises marking blocks partitioned from the 
document image based on features of wavelet coefficients of the respective blocks. 

3. The method as claimed in claims 1 or 2, wherein the extracting of the 
pictures areas comprises a hierarchical extraction comprising extracting picture blocks 
from the document image to generate one or more initial picture areas and refining the 
initial picture areas by extracting picture pixels adjacent to the initial picture areas. 

4. The method as claimed in any one of claims 1 to 3, wherein the extracting 
of the character areas from the document image comprises utilising a customised 
definition of the connectivity of the pixels. 

5. The method as claimed in any one of claims 1 to 4, further comprising 
generating style data as a description of the templates and character blocks. 

6. The method as claimed in claim 5, wherein the classifying the character 
blocks comprises a hierarchical matching comprising matching the style of each 
character block based on the style data and then matching each character block against 
selected ones of the templates based on the style data matching. 

7. The method as claimed in any one of claims 1 to 6, wherein the 
classifying of the character blocks based on the templates comprises morphological 
matching. 
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8. The method as claimed in claim 7, wherein the morphological matching 
comprises matching algorithms and M 2 , 

9. The method as claimed in claim 8, wherein different structure elements 
are utilised for different types of document images. 

10. The method as claimed in any one of claims 1 to 9, further comprising bit 
plane storage of a compressed stream of the document image in the order of character 
areas, picture area and background image for progressive decoding. 

11. A method of decoding a compressed document image stream, the 
method comprising: 

extracting one or more picture areas from the compressed document image 

stream; 

extracting one or more character areas from the compressed document image 

stream; 

extracting a background image from the compressed data image stream; 

identifying character blocks of the character areas with reference to dynamically 
generated templates in the compressed document image stream; 

decoding the background image utilising a wavelet based SAQ method; and 

constructing a decoded document image by adding the picture areas, the 
character areas and the background image. 

12. A computer readable data storage medium having stored thereon code 
means for instructing a computer to execute a method of encoding a document image, 
the method comprising: 

extracting one or more picture areas from the document image; 

extracting one or more character areas from the document image; 

obtaining a background image by subtracted the image and character areas from 
the document image; 

classifying character blocks of the character areas with reference to dynamically 
generated templates; and 

encoding the background image utilising a wavelet based SAQ method. 
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13. A computer readable data storage medium having stored thereon code 
means for instructing a computer to execute a method of decoding a compressed 
document image stream, the method comprising: 

extracting one or more picture areas from the compressed document image 

stream; 

extracting one or more character areas from the compressed document image 

stream; 

extracting a background image from the compressed data image stream; 

identifying character blocks of the character areas with reference to dynamically 
generated templates in the compressed document image stream; 

decoding the background image utilising a wavelet based SAQ method; and 

constructing a decoded document image by adding the picture areas, the 
character areas and the background image. 

14. A system for encoding a document image, the system comprising: 
means for extracting one or more picture areas from the document image; 
means for extracting one or more character areas from the document image; 
means for obtaining a background image by subtracted the image and character 

areas from the document image; 

means for classifying character blocks of the character areas with reference to 
dynamically generated templates; and 

means for encoding the background image utilising a wavelet based SAQ 
method. 

15. A system for decoding a compressed document image stream, the 
system comprising: 

means for extracting one or more picture areas from the compressed document 
image stream; 

means for extracting one or more character areas from the compressed 
document image stream; 

means for extracting a background image from the compressed data image 

stream; 
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means for identifying character blocks of the character areas with reference to 
dynamically generated templates in the compressed document image stream; 

means for decoding the background image utilising a wavelet based SAQ 
method; and 

means for constructing a decoded document image by adding the picture areas, 
the character areas and the background image. 
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